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The paper has established and verified the theory prevailing widely among image and pattern 
recognition specialists that the bottom-up indirect regional matching process is the more stable and 
the more robust than the global matching process against concentrated types of noise represented 
by clutter, outlier or occlusion in the imagery. We have demonstrated this by analyzing the effect 
of concentrated noise on a typical decision making process of a simplified two candidate voting 
model where our theorem establishes the lower bounds to a critical breakdown point of election 
(or decision) result by the bottom-up matching process are greater than the exact bound of the 
global matching process implying that the former regional process is capable of accommodating 
a higher level of noise than the latter global process before the result of decision overturns. 

We present a convincing experimental verification supporting not only the theory by a white- 
black flag recognition problem in the presence of localized noise but also the validity of the 
conjecture by a facial recognition problem that the theorem remains valid for other decision mak- 
ing processes involving an important dimension-reducing transform such as principal component 
analysis or a Gabor transform. 

Categories and Subject Descriptors: 1.2.10 [artificial Intelligence]: Vision and Scene Under- 
standing — Video analysis; 1.5.2 [Pattern Recognition]: Design Methodology — Classifier design 
and evaluation; H.l [Information System]: Models and Principles 

General Terms: Stability, Voting 

Additional Key Words and Phrases: Global Matching, Regional Matching, Noise, Pattern Recog- 
nition 



1. INTRODUCTION 

Consider some decision making process G which arises quite frequently in many 
scientific researches as well as in daily life events. Given several candidates (or 
selections) to choose, we must choose one candidate (selection) by using the voting 
scheme or by matching the features extracted from the entire area (or the nation). 
We call this G as a global voting ( or matching) method. Many countries adopt the 
system in choosing a President of the country as a Peruvian president. Now let us 
convert G to a new version r(G), where we decide the winner of each pre-divided 
regions by G but we make the final decision by a simple majority of the number 
of the winning regions by adopting the "winners-take-all" principle within the pre- 
divided regions. The latter converted version r(G) is called a regional voting ( or 



An extended abstract of the preliminary version of the paper has been presented at AIDA'99 
(International ICSC Symposium on Advances in Intelligent Data Analysis), Rochester, New York 
in June of 1999. 

Address: Computer Science Department, Utsunomiya University, Utsunomiya, Japan 321-8505; 
e-mail: lchen@alfin.mine.utsunomiya-u.ac.jp, tokuda@cc.utsunomiya-u.ac.jp. 



2 • L. Chen and N. Tokuda 



matching) method. A most typical regional voting method is the US presidential 
election system. It is the robustness of the decision making by G or r(G) processes 
that we want to clarify in the present analysis against a concentrated type of noise. 

Here, G could be any decision making procedure. In the simple voting system, 
G makes use of voting and makes a decision by the majority of votes counted. In 
the facial recognition problem, for example, G based on this voting system may 
involve a pixel-by-pixel comparison between the two facial images to be compared 
or we could use any of well-known dimension-reducing schemes such as the principal 
component analysis (PCA) [Jolliffe 1986] or a Gabor transform [Lee 1996] as G, the 
former PCA scheme leading to the famous eigenface method [Turk and Pcntland 
1991]. The resulting reduced facial space of PCA is instrumental in obtaining the 
major cigenfaces of the training, for example and it is easy to find the co-ordinate 
of a new face projected into the facial space, and then make a recognition decision 
by matching the projection with stored images of models. Then the corresponding 
r(G) could be implemented first by dividing the whole two dimensional picture of 
rectangle frame into smaller regions (of equal size in our analysis). Within each 
region we make use of the PCA method in decision making as G and we make the 
final decision of the whole picture in accordance with the simple majority principle 
in the number of the winning regions where the winner gets all the votes of the 
region. It is the improved stability of r(G) over that of G that has motivated our 
current research as verified by several convincing numerical examples given in this 
paper. 

The purpose of the present paper is to elucidate and clarify a basic mechanism 
why the regional matching method has an advantage in stability over the global 
matching scheme against noise. A simplest voting model is selected for analysis 
where each cell in the nation consists of one vote, thus G itself can be regarded as 
a simple national voting scheme where the winner is decided by a simple majority 
principle. To simplify the analysis on the regional matching, we divide the nation 
into smaller regions of equal size where the winner is decided by the number of 
winning regions, the winner of a region being decided by G. We set up a noise-and- 
voting model for this simple situation and show that when the size of the regions is 
reasonably small, the regional voting scheme is more stable than the national voting 
scheme. A conjecture is made that this model is valid in a more general decision 
making process where G involves a decision making process by PCA matching or 
Gabor matching. We present a convincing experimental verification to support the 
conjecture in appendix. 

The present paper is constructed as follows. In Section 2, we first give precise def- 
initions on noise, noise-concentrated area and the number of noise-contaminated re- 
gions including basic assumptions used in the analysis. We prove Theorem 2.1 which 
relates the noise-concentrated area and the number of potential noise-contaminated 
regions. Theorem 2.2 shows how we can improve the relation. The resulting Corol- 
laries 2.1 and 2.2 corresponding to Theorems 2.1 and 2.2 respectively give the lower 
bounds of the noise level to a breakdown point of decision beyond which the deci- 
sion of the voting may overturn. In Section 3, we examine the results of Section 2 
from various angles. A very convincing experimental verification of the theory is 
presented in Section 4 using a black-and-white flag recognition problem confirming 
the validity of the theory on a pixel- by-pixel basis. An experimental verification 
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given in appendix supports the conjecture that the theory developed for the pixel- 
by-pixel voting process remains valid for more general decision making processes 
involving dimension reducing schemes such as PCA or Gabor transform. 

2. THEOREMS 

2.1 Notations 

Important notations and basic assumptions used in the paper will be summarized 
here. 

We suppose that the nation (or the entire image for an image application) consists 
of N unit cells (or pixels) each having one vote to exercise; for simplicity the nation 
is always represented by a rectangle of size I x m, so that N = I x m. The nation 
on the other hand can be partitioned into K square, equal sized regions of m r x m r 
each. We also assume that both I and m are divisible by m r and that the pair of the 
opposing edges along the outer boundary of the rectangular nation are to glide onto 
the other end as glued together so that the nation can be partitioned into a total 
of m\ = N/K different partitions. Like m r which is the length scale of a square 
region, m n denotes that of a noise-concentrated block, with the terms "noise" and 
"noise-concentrated block" being defined in Definition 2 of next subsection. 

2.2 Main Theorems 

We analyze in this paper a very simplified model allowing only two candidates in 
the election, say, two candidates A and B. Without losing generality, we assume, in 
the absence of external sources of noise, A% of total cells vote for A and B% cells 
vote for B so that 

A% + B% = \ and A% > B%. 

We discuss our possible extension to an n candidates system in section 3.2.1 
We examine the effect of "concentrated" noise on the decision making process of 
election results by the global voting and the regional voting. In image application, 
such noise is often observed in case the imagery contains transparency, specular 
reflections, shadows, fragmented occlusion as seen through branches of a tree or a 
sun shade and occlusion [Black and Anandan 1996]. 

The formal definition of concentrated noise as well as the formal definition of the 
global voting and regional voting is given below. 

Definition 2.1 (VOTING). 

• National Voting- The entire population N of the nation vote either for Can- 
didate A or B and Candidate A wins if and only if he gets a majority of the N 
votes. 

• Regional Voting- The population (— N/K for K regions) of a region vote 
for Candidate A or B and a majority of votes determine the candidate of the region 
and a majority of the K winning regions, not the majority of the entire population 
N of the nation, determines the winner for the nation. 

Definition 2.2 (NOISE). 

• We call a set of noise anti- A- noise ( or anti-B-noise ) if all the cells under in- 
fluence will vote for B (or A) regardless of whether it originally votes for A or B. 
The number of the cells under influence is called the number of noise units. 
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• We call a vote noise- contaminated if the vote of a cell happens to undergo a 
change either from candidate A to B or from candidate B to A under some changes 
of environmental conditions. The noise-contaminated vote undergoing a change 
from candidate A to B (or B to A) is especially called anti- A-noise-contaminated 
vote ( or anti-B-noise-contaminatcd vote ) respectively. 

• Anti- A-noise- concentrated blocks (Anti-B-noise- concentrated blocks) are defined 
as non-overlapped m n x m n sized areas among which all the cells are under influence 
of anti- A-noise (anti-B-noise). 

• The anti- A-noise- concentrated (anti-B-noise-concentrated) area is defined as 
the union of all anti- A-noise- concentrated (anti-B-noise- concentrated) blocks. 

• The region is defined to be anti- A-noise- contaminated (anti-B-noise-contaminated) 
if and only if the conjunction set of the region and the anti- A-noise- concentrated 
(anti-B-noise-concentrated) area is not empty. 

In the analysis, we assume that there is only anti-^l-noise. 

Assumption 2.1. The effects of anti-B-noise on election results will be ignored 
in the analysis . 

This assumption will be justified for the following two reasons. Firstly the anti- 
B-noise and the anti-^4-noise are independent so that we may consider the effect 
of the anti-^4-noise entirely independent of the anti-B-noise. Secondly we want 
to establish a lower bound to a breakdown point in the prevailing situation of 
A% > B%. We see that the anti-^l-noise gives a lower bound in terms of a noise 
level up to which we can accommodate before the results of the regional as well 
as the global voting reverse or overturn. The result for the regional voting will be 
established in Theorems 2.1 and 2.2 and Corollaries 2.1 and 2.2 while the exact 
bound for the national voting is given in Observation 2.1. 

As we have emphasized, we only consider locally "concentrated" noise. Thus we 
have: 

Assumption 2.2. All the anti-A-noise is within anti- A-noise- concentrated area. 

This assumption will always hold, because we can regard a smallest block size of 
a single cell as the size of a noise-concentrated block at a worst case. For general 
cases of m„ x m n cells excluding blocks consisting of single cell size, it is not difficult 
to conclude that a possible error between the anti-^4-noise of the nation and the 
anti-^l-noise-concentrated area becomes negligibly small as the size of each of the 
noise-influenced areas increases sufficiently. Thus this assumption will not affect the 
validity of all the following theorems. We will clarify the situation in Observation 
3.2 of section 3.2.3. 

The following assumption is made in terms of the definitions we introduce. 

Assumption 2.3. 

• Average Distribution Assumption- We assume that in the absence of noise, 
the voting distribution of the undisturbed national voting prevails in any sufficiently 
large size areas whether consisting of a continuous part of the nation or of randomly 
chosen blocks of cells. 

• Region Size- We assume that the size of equally partitioned regions is suffi- 
ciently large so that in the absence of noise, the average distribution assumption 
above holds. 
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The assumption implies that, in the absence of noise, the global voting behavior 
of A% and B% prevails in each of the regions such that there are almost A%(N/K) 
cells voting for A and B%(N/K) cells voting for B. This assumption can be relaxed 
(see section 3.2.5). 

We conclude that, if Candidate A (or B) wins in the nation, so does Candidate 
A (or B) in each of the regions. 

Observation 2.1. If there exists more than A% ~ B% xN of ' anti-A-noise- contaminated 
votes, that is, if A% ^% % °f ^ e or W na l votes cast for A should change to B, then 
the noise is effective in reversing the candidate selection from A to B in the national 
voting. We say the national voting can accommodate A %~ B % x N noise before a 
reversal of the original voting result takes place. 

Definition 2.3. We call a region anti-A-noise- contaminated if and only if the 
conjunction set of the region and the anti-A-noise- concentrated area is not empty. 

The following lemma shows that we can construct a partitioning of the nation 
such that the noise-concentrated blocks are concentrated into some fractions of all 
the regions. It gives a clue why the regional voting is capable of accommodating a 
higher noise level than the averaged national voting because only a fraction of the 
entire regions absorb the dominant effects of the noise superimposed. 

Lemma 2.1. For any given small positive integer s, we can always choose a par- 
tition of the rectangle nation into K regions such that anti-A-noise is concentrated 
among a fractional K' regions of the K regions (K 1 < K) so that the total size of 
these K' regions is less than that of the anti- A- concentrated area plus s units. 

Proof. The lemma can be proved directly by considering the worst case: namely 
we can always divide the rectangle at worst case into K = I x m regions of unit 
size, which means the above difference always vanishes. □ 

Note that depending on the number of s and the distribution of anti-yl-contaminated 
noise, we do not always have to divide the nation into regions of unit size and a 
noise-concentrated block can be found, fulfilling the conclusion of the lemma. 

The following theorems show the relations of the size of anti-^4-noise-concentrated 
area and the total size of anti-^4-noise-contaminated regions in the worst case which 
lead to the lower bounds to a breakdown point of decision. 

Theorem 2.1. Let S c be the size of anti-A-noise- concentrated area and S r be 
the total size of anti-A-noise- contaminated regions of K-partitioned regional voting. 
We then have: 

W f:<(r^l + i) 2 xgr. 

2 

(2) S c < x l/d^T"! + l) 2 x 50%Af is a sufficient condition for the regional 
voting to retain the original candidate selection of A. 

Proof. Item 1 of the theorem follows immediately for m r > m n because for 
any combination of partitioning, each of the anti-^4-noise-concentratcd block of size 
m„ x m„ can at best "contaminate" (["^1 + l) 2 equal size regions of m r x m r . 
It is a simple matter to confirm that the conclusion is valid for m„ > m r as well. 
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Item 2 of the theorem comes from the fact that at most 50% of the K regions can 
be contaminated when S c < ^| x 1/(^1 + I) 2 x 50% A. □ 

We immediately have the following Corollary. 

Corollary 2.1. The original candidate selection of A can accommodate at least 
% x 1/(^1 + If x (A%/2)N anti-A-noise (i.e. =| x l/([^] + l) 2 x 50% of 
the cells voting for A), before the candidate selection is reversed. 

Theorem 2.1 and Corollary 2.1 show clearly that to retain the original candidate 
selection of A in the regional voting, a larger subdivision of the nation namely into 
a smaller size region leads to a higher stability, provided that Assumption 2.3 on 
region size remains valid. 

2.3 Further Improvement by Shifting Strategy 

The bounds of Theorem 2.1 and Corollary 2.1 can be further improved by exploiting 
the Shifting Strategy of [Hochbaum and Maass 1985]. We first define a Shifting 
Strategy for some operation A with respect to a square region embedded within a 
rectangular nation. 

Shifting Strategy for Certain Action A 
Consider the partitioning of a rectangular nation into m r x to,, square regions where 
m r is some arbitrary integer. 
Repeat step 1 to step 2 m r times: 

(1) Move all the vertical partition lines to right by one cell, repeat step 2 for m r 
times; 

(2) Move all the horizontal lines up by one cell, execute Action A. 

The shifting strategy enumerates all the possible different partitioning of the 
nation. Now by replacing Action A of the strategy with the regional voting subject 
to the same noise environment, we show by Theorem 2.2 below how we can improve 
Theorem 2.1. 

Theorem 2.2. Under the assumptions of Theorem 2.1, the shifting strategy en- 
sures that there exists at least one partition satisfying the following properties: 

(1) t < r-rr 1 ) 2 - 

(2) The sufficient condition for the regional voting to retain the original candi- 
date selection of A can be improved to: S c < ( — _ 1 ) 2 x 50%A. 

Proof. To prove item 1, we must show that among all the possible to 2 , different 
partitions that the Shifting Strategy can possibly generate, each of m n x to„ size 
anti-4-noise-concentrated block is capable of contaminating the total of (m„ + 
to,. — l) 2 different regions. Once this is done, the Pigeon Hole Principle [Aho and 
Ullman 1992] ensures that there is at least one partition in which the existing S^/m 2 
anti-4-noise-concentrated blocks contaminate at most (S c /mn){m n — m r + 1) 2 /to 2 
regions. 

We prove this for m r > to„ first. Among all possible to 2 partitions, (m„ — l) 2 
partitions divide the block into 4 regions, 2x (to„ — l)+2x (m n — l)(m r — m n ) of them 
divide the block into 2 regions, while (m r — m n + 1) 2 of the partitions can not divide 
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Fig. 1. Different Partitioning in the Nation 

the block into more than one region. Summing them up, the noise-concentrated 
block is divided into (m n + m r — l) 2 different regions. Figure 1 illustrates the three 
cases above for m n = 5 and m r = 8, for example. We see that the partitioning 
through the solid dots of the figure divide the noise block into 4 regions, those 
through the crossed points into 2 regions, while those through the hollow dots may 
not be able to divide the block. 

For m r < m n , we enumerate each of all the m r x m r possible partitions. We 
know that the block would be divided into [^1 • T^l , T^l • ( [Hh^i! + 1), fJ2n] . 

cr^i+i), r^vcr^i +!),•••, r^i ■(r m "- m r +1 i+ 1 ); (r^i+i)- r^i. 

(r^i+ij-cr^i+i). (r^i+ij-cr^i+i). (r^i+inr^i+i). 

••^(r^i+ l )-(r iii ^^i+i);(r^i+i)-r^i,(r^i+i)-(r i ^i+i): 

(r^i + 1) • (r^i + 1). (r^i + 1) • cr^i + 1), • • •» (R^i + 1) • 

(r mn x r+1 i + x )5 •■ > (r m "Z r+1 i + x ) • ra> ( r-~„r +1 i + ^ • (r^i + i)> 
( r m B -m r+ i -| +1) . (rg ^2-| +1)i (r ^^ i+i)-([^i+i),-, (r m "X r+1 i+ 

1) • ([ "" ~" r+1 1 + 1) re gi° ns separately. Summing up all the possible terms above 
by means of formula 



m r — 1 




we know that the block will be divided into (m n — m r + l) 2 different regions for all 
the m r x m r different partitions. 

Item 2 follows from item 1 since the condition that the number of noise-contaminated 
regions is to be less than 50% of the total number of regions constitutes a sufficient 
condition for the original candidate selection of A □ 

Corollary 2.2. For a fixed sized noise-contaminated region and a fixed sized, 
equally partitioned region, we can find at least one partition such that a specific par- 
tition can accommodate at least: ( — ™^ _ 1 ) 2 x (A%/2)N anti-A-noise- contaminated 
votes (that is ( — _ 1 ) 2 x 50% of all the votes originally given to A) before the 
result of candidate selection is reversed. 
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CONJECTURE: Theorem 2.1 and Theorem 2.2 and related Corollaries remain 
valid for a more general G including the features matching by PC A analysis. 
We confirm this conjecture by means of experimental verifications in Appendix. 

3. CONCLUSION AND DISCUSSION 

The detailed analysis on the regional and national voting convincingly shows that 
the regional voting is the more stable and robust of the two. This is in agreement 
with our physical intuition as supported by lemma 2.1, that only a small fraction 
of the entire regions absorb the heavily concentrated effects of noise. We would like 
to give some concrete examples numerically below where possible. 

3.1 Conclusion 

3.1.1 Robustness of Regional Voting. Table 1 is computed from the formula of 
Corollary 2.1 and Observation 2.1 of Section 2.2 for several values of (A% — B%). 

Table 1. Stability Margins of Regional Voting and National Voting for N=10000 
Stability Margins of anti- A-noise- contaminated votes which Regional and National 
voting can accommodate before the decision reverses. 



A% - B% 


Regional Voting 


National Voting 


m„/m r = 1 


m n /m r = 2 


5% 


656 


1167 


250 


10% 


688 


1222 


500 


20% 


750 


1333 


1000 



We see that the regional voting is always more stable and robust than the national 
voting when A%~B% is not too large, say 5% and 10% and this robustness increases 
as ^ increases or the partitioned region size becomes smaller. 

For larger values of A% — B%, say, 20%, at m n /m r = 1, the lower bound of 
regional voting given by Corollary 2.1 is smaller than the exact bound of national 
voting. We still believe that the regional voting can still be more stable than the 
national voting, because in counting the number of possible "losing" regions in 
Theorem 2.1, we have counted the number of all the noise-contaminated regions 
including those which still have a margin to the breakdown point retaining the 
pro- A region. In fact, we have excluded only those regions entirely clean or free of 
any noise. 

3.1.2 Improvement by Shifting Strategy. Given some distributed noise-concentrated 
area as the union of all noise-concentrated blocks, Theorem 2.2 and Corollary 2.2 
provide a method of improving the stability of the regional matching as shown in 
Table 2. A larger improvement is evident for smaller A% — B%. 

3.1.3 A Tradeoff on Region Size . By examining the theorems, we see that there 
must be a tradeoff for the size m r of the partitioned regions; the smaller size of a 
partitioned region increases the robustness of the regional voting but, Assumption 
2.3 requires the size not too small. This is remarkably well born out experimentally 
in figure 6 where at low as well as at high noise level, the recognition rate falls off 
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if the entire image of 80 x 120 pixels is divided into more than 384 regions (a region 
size corresponding to 5 x 5 pixels) to which we come back later on. 

3.2 Discussion 

3.2.1 What if there are more than three candidates? . If the number of candidates 
exceeds two, the number of decision making processes increases. For example, we 
may allow each region to select top two or more candidates at a time, then make 
the candidate selection based on the summed results of all the regions. Here, we 
set up one simple model where the basic decision making principle adopted in the 
two candidate system is retained. Each of the regions selects only one (1) candidate 
according to a simple majority principle, and then the regional voting selects one 
candidate who wins a majority of the winning regions. Suppose there are candidates 
A, B, C, and A% > B% > C% > ■ ■. The anti-A-noise is defined to convert the 
votes originally for A to B and keep other votes unchanged. We have the following 
theorem by exactly same proof of Theorem 2.1. 

Theorem 3.1. 

1. The national voting can only accommodate at most (A% — B%)/2 ■ N anti-A- 
noise- contaminated-votes, i.e. (A% — B%)/2A% among all the votes originally for 
A. 

2 

2. Regional voting can accommodate at least x 1/(1"^-] + 1) 2 x (A%/2)N anti- 

2 

A-noise-contaminated votes i.e. x + l) 2 x 50% among all the votes 

originally for A. 

We have an entirely same conclusion as in the two candidate system, confirming 
that the regional voting still accommodates a higher level of noise when A% and 
B% are very close. 

3.2.2 Effect of salt-and-pepper noise. In sharp contrast to localized and thus 
concentrated noises we have assumed in the present paper, we examine the effects 
of impulse-type salt-and-pepper noise ([Pratt 1991]). 

Definition 3.1. The white noise is a set of noise "uniformly" distributed over 
the nation in such a way that in each of reasonably large sized areas whether com- 
posed of a continuous part of the nation or of randomly chosen blocks of cells, we 
have a same percentage of noise. 

Consider only the impulse-type well dispersed anti-A-noise. The similar conclu- 
sion about the noise bound as Observation 2.1 can be obtained: 

Observation 3.1. The global voting and regional voting can accommodate the 
same percentage of salt-and-pepper noise. 

The observation could easily be proved by noting that in each region or the whole 
nation, the candidate selection will not reverse unless there is less than ^(A — B)% 
of salt-and-pepper noise. 

The observation shows that as long as the partitioned region is large enough to 
allow the original distribution of the entire nation to prevail, we expect no difference 
between the two decision systems in the presence of salt-and-pepper noise. It is 
when the uniform distribution assumption fails between the national voting and 
the regional voting as we have assumed in the paper that the difference matters. 
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3.2.3 Size of noise-concentrated blocks . The generality of our analysis will heav- 
ily depends on the fact that a possible error between the total size of all noise 
generated and the union of all non-overlapped noise-concentrated blocks be kept 
negligibly small. Let us cut out some m n x m n blocks out of reasonably concen- 
trated noise-contaminated area. It is reasonable to assume that m n x m n is small 
compared to the size of any continuous parts of noise-contaminated area. We ex- 
pect that a measure of what is left after cutting out blocks is surely quite small 
compared with the total number of noise. We demonstrate this formally below by 
showing why our theorems and corollaries should remain valid. 

Definition 3.2. A line segment is called an orthodiameter of a continuos area 
of the nation, if and only if 

(1) all the points of the segment lie within the area; 

(2) only the two end points of the line segment lie on the boundary of the area; 

(3) the line segment is parallel to the horizontal lines or the vertical lines com- 
prising the boundary of the nation. 

Definition 3.3. The orthomeasure of an area (continuous or detached) of the 
nation is defined by the length of the shortest orthodiameter of any continuous part 
of the area. 

Observation 3.2. Let us cut out as many m n x m n blocks as possible and let 
S c be the total size of all these blocks (i.e. the size of noise-concentrated area 
discussed in definition 2). Suppose OM be the orthomeasure of the set of noise 
influenced votes of the nation, and N n be the number of noise (or, the number of 
noise-affected cells). We have: 

v N n -S c N n -S c 
hm — — = lim = 0. 

OM „ AT OM „ S 

mn _i — *oo J»n mn _ 1 — >oo ^>c 

Note that the above observation includes the situation that N n — S c = when 
m„ = 1. 

3.2.4 All of the anti-A-noise- concentrated blocks are effective in reversing the 
votes ?. Suppose that in a noise-concentrated area, only r% of votes for A under- 
going changes to B where r is some constant within [0, 1] and closer to 1. All the 
results in theorems 2.1-2.2 remain the same while those of corollaries 2.1-2.2 may 
be divided by r. We believe that the conclusion of section 3.1 still holds because in 
all the corollaries we always use quite much larger values for lower boundaries as 
we have thrown away all the regions contaminated with a single noise. 

3.2.5 Relaxing the Average Distribution Assumption . In view of the proofs of 
Theorems 2.1-2.2 given in Section 2, the restriction of regional size in Assumption 
2.3 in Section 2.2 is too strict and can be relaxed as follows. 

Observation 3.3. All the conclusions of Theorems 2.1 and 2.2 and Corollaries 
2.1 and 2.2 still hold, as long as we can choose the size of partitioned regions large 
enough such that the voting distributions in the absence of noise for Candidates A 
and B in each of the regions satisfy 

A% > B%. 



Robustness of Regional Matching Scheme over Global Matching Scheme • 11 

We emphasize that the voting distributions in the regions do not have to follow 
that of the nation in the absence of noise. 

4. EXPERIMENT: WHITE OR BLACK DOMINATED FLAGS 

The first example relates to a white-black mixed flag which we want to recognize 
either as a white or a black dominated flag (see figure 2 for illustration where the 
cells in the figure denote a smallest unit of a "pixel"). Unlike the second example 
to follow, this example applies the present theory directly on a pixel by pixel basis 
without resorting to features extracting transformation such as Turk & Pentland 
method. The size of the "nation" is 15 x 24 cells and the partitioned region comprises 
a3x3ora4x5 block which is not either too large nor too small relative to the size 
of the nation. Suppose that a white-dominated flag is given as in figure 2-(l). This 
is confirmed easily by both global and regional voting because in the global voting, 
"White" gets 207 votes while "Black" 153 votes; by regional vote counting based 
on a 4 x 5 regional partitioning, "White" wins in 12 regions while "Black" does in 
4 regions, and within another 2 regions "White" and "Black" get same votes. If we 
further divide the nation into 3x3 sized regions. "White" wins in 28 regions while 
"Black" does in 12 regions in regional voting. Now we choose arbitrary 7 pixels 
of figure 2-(l) randomly and introduce anti- White-noise blocks of a 5 x 5 areas so 
that every "white" pixel within the block are transformed to a Black pixel with the 
probability of "0.7" . As a result, 35 "white" pixels are changed to "Black" (are anti- 
white-noise contaminated) transforming figure 2-(l) to figure 2-(2). By counting, 
we see that after the noise is added, the global voting will reverse the results of the 
candidate selection from "White" to "Black" dominated because this time "Black" 
gets 188 votes while "White" gets only 172 votes in global voting. But, by regional 
voting having the size of 4 x 5 cells, the original selection of "White" dominated 
still remains valid, because this time "White" wins in 10 regions while "Black" 
does so in 6, and within another 2 regions "White" and "Black" get same votes. 
If we further divide the nation into 3x3 sized regions, we will see that "White" 
wins in 25 regions while "Black" does in 15 regions in regional voting increasing 
the stability margin thus confirming our theory. 




Fig. 2. "White" or "Black" Dominated? 
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Appendix: Facial recognition 

We show a most convincing verification of the conjecture in practical image pro- 
cessing applications by carrying out a set of facial recognition experiments subject 
to localized concentrated noise. We have used the images of 16 people. As shown 
in figure 3- (2) and -(3), we introduce circular blocks of noise of Photo-shops version 
4.0 randomly into the test images at a low and high noise levels of 25% and 50% 
levels(see figure 3). Circular blocks of noise by Photoshops have a density variation 




(1) Original Image (2)Lower Noise Level (3) Higher Noise Level 



Fig. 3. Typical Noise Contaminated Images At low and High Noise Level 

within the area ranging from to 255 level. We have defined the area to be noise- 
affected if the density levels of the original image and those of the noise-affected 
image differ by 64 (i.e. 25% x 256) in pixel density. The noise level in each of the 
images is shown in Table 3. Each picture is of size 80 x 120 = 9600 pixels, the first 
row of each level indicates the numbers of noise, the second row the percentage of 
the noise. 

Turk and Pentland's eigenvector algorithm [Turk and Pentland 1991] is used for 
features extraction or data compression purpose where the eigenvector transforma- 
tion operates uniformly not only over each of discrete pixels in the nation but also 
over each of pixels of partitioned regions. Now that the effects of random noise 
remain random on the transformed planes without being magnified or filtered, we 
assume that the effects of noise blocks remain transparent to the transformation, 
implying that the conjecture remains valid. For convenience, we now resort to a 
new partition notation dividing the nation 1 (namely global), to 8, 16, 32, 64, • • •, 
regions so that the division of 1 corresponds now to the original national matching. 

Recognition rates of the experiments are compared in figures 4, 5 and 6 illustrat- 
ing the results of 1-region up to 16-, 32-, 64-, • • • regional matching at both low and 
high noise levels respectively. Figures 4 and 5 represent the percentages of the re- 
gions which the correctly recognized faces have won. But note that the percentages 
of the maximum vote obtained differ considerably among the candidates depending 
on how the votes distribute among the candidates; a high percentage of correctly 
recognized regions does not imply directly that the corresponding face is correctly 
recognized in the regional matching. Examining the data of figures 4 and 5, we 
may make a general statement that the percentages of the winning regions for a 
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correctly recognized candidate are almost always higher as the number of partitions 
increase. Exceptions are shown in figure 5 when we divide the nation into too many 
regions. This behavior is confirmed by figure 6 where the recognition rates start to 
fall off as the number of regions increase beyond 384 which correspond to the region 
size of 5 x 5 pixels for the entire image size of 80 x 120 pixels. Obviously when the 
region size is less than 5x5 pixels, not only Assumption 2.3 is invalidated but 
the eigenvector method may not give a meaningful result. Figure 6 shows clearly 
that the regional matching always gives a better recognition rate than the national 
matching, and that the smaller the regions we choose, the higher recognition rate 
we will have for the regional matching. When the regions divided are too small, 
say, each region being of size less than 5x5 pixels for 384 regions and 3x2 pixels 
for 1600 regions in this example, the recognition rate will decrease and deteriorate. 
This very convincingly support the soundness of our theorems, implying also that 
the soundness of the Average Distribution Assumption in Section 2 and the validity 
of the relaxing conditions on the assumption of Section 3.2.5. 

Lower Noise Level 




10% - 



L- 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 

Image Index 

Fig. 4. The Rates of Correctly Recognized Regions in Each Regional Matching Scheme for Lower 
Noise Level Images 



The superiority of the regional matching is evident for images at lower noise level. 
For 16 regional matching, 10 images out of 16 candidates are recognized correctly 
while for 384 regional partitioning, all 16 images are recognized correctly, while the 
Turk and Pendland's [Turk and Pentland 1991] eigentemplate matching method on 
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Table 2. Improved Regional Voting by Shifting Strategy Calculated for N = 10000 



A% - B% 


m n = 3, ra r — 3 


m n = 4, m r — 2 


Corollary 2.1 


Corollary 2.2 


Corollary 2.1 


Corollary 2.2 


5% 


656 


945 


1167 


1680 


10% 


688 


990 


1222 


1760 


15% 


719 


1035 


1278 


1840 


20% 


750 


1080 


1333 


1920 



Table 3. The Noise of Each Images 
The first row of each level indicates the numbers of noise while the second gives he 
level of noise in percentage 



Noise suffered 



Training face images of individuals (80 x 120 size) 



face images 


No.O 


No.l 


No.2 


No.3 


No.4 


No.5 


No.6 


No.7 


No.8 


No.9 


No.10 


No.ll 


No.12 


No.13 


No.14 


No.15 


Lower 
level 


No. 


2757 


2450 


2429 


1871 


2004 


1729 


1959 


2743 


2502 


2379 


2043 


2437 


1566 


2895 


2517 


1946 


% 


28.7 


25.5 


25.3 


19.5 


20.9 


18.0 


20.4 


28.6 


26.1 


24.8 


21.3 


25.4 


16.3 


30.2 


26.2 


20.3 


Higher 
lever 


No. 


5748 


5474 


5701 


4372 


4909 


5051 


5359 


4464 


4907 


5073 


4755 


4900 


4762 


5377 


5711 


4811 


% 


59.9 


57.0 


59.4 


45.5 


51.1 


52.6 


55.8 


46.5 


51.1 


52.8 


49.5 


51.0 


49.6 


56.0 


59.5 


50.1 



Higher Noise Level 



50% 



40% 



30% 



20% 



10% 



16-reginal matching 
384-reginal matching 
1600-reginal matching 








Fig. 5. The Rates of Correctly Recognized Regions in Each Regional Matching Scheme for Higher 
Noise Level Images 
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1 8 16 32 64 384 1600 

Number of Regions 



Fig. 6. Recognition Rates of Each Regional Matching Scheme and National Matching 

the whole image can only recognize 5 images out of 16 (figure 6) 1 . At a higher noise 
level, 16 regional matching recognizes only 1 images while 384 regional matching 
recognizes 6 (figure 6). This should be compared with the global matching of one 
correct recognition. Increasing the number of regions does not necessarily improve 
the results further. The numerical results for 1600 regions, which can recognize 
two, confirm this fact. This is a tradeoff problem on the size of the partitioned 
regions as discussed in section 3.1.3. 

The first motivation for this work arose from the entirely same situation in facial 
recognition problems by Gabor wavelet analysis [Huang et al. 1999] where the 
matching is carried out in 8 x 8 Gabor regions of window (equivalent to 8 x 8 
partitioned regions in the present paper). We are able to identify the faces 100% 
using the images of 16 people under three different lighting conditions including 
head-on lighting, 45 degree lighting and 90 degree lighting conditions. On the 
contrary, by the global vote counting method where Turk and Pentland's original 
eigenface algorithm [Turk and Pcntland 1991] is used, we were able to correctly 
identify the faces with 87.5% accuracy. However, the comparison given there is 
not decisive in determining the superiority of the regional matching over the global 
matching. This is so because the regional matching involved the Gabor transform 
as well as the eigenvector decomposition on one hand while the global matching 



Detailed tables are available but in the interest of space they are not given in 
the paper but may be downloaded from authors' homepage http://alfin.mine. utsunomiya- 
u.ac.jp/" Ichen/papers /voting-tables. ps or obtained on request to the authors. 
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did involve only the eigenvector decomposition. Furthermore, noise added due 
to different lighting conditions does not strictly satisfy the condition of localized 
concentration. This paper is prepared to give a solid support to the validity and 
stability of the wavelet-type regional matching. 
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